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The horizontal visibility algorithm has been recently introduced as a mapping between time series 
and networks. The challenge lies in characterizing the structure of time series (and the processes 
that generated those series) using the powerful tools of graph theory. Recent works have shown that 
the visibility graphs inherit several degrees of correlations from their associated series, and therefore 
such graph theoretical characterization is in principle possible. However, both the mathematical 
grounding of this promising theory and its applications are on its infancy. Following this line, here 
we address the question of detecting hidden periodicity in series polluted with a certain amount of 
noise. We first put forward some generic properties of horizontal visibility graphs which allow us to 
define a (graph theoretical) noise reduction filter. Accordingly, we evaluate its performance for the 
task of calculating the period of noisy periodic signals, and compare our results with standard time 
domain (autocorrelation) methods. Finally, potentials, limitations and applications are discussed. 

PACS numbers: 



I. INTRODUCTION 

In the last years, some methods mapping time series to network representations have been proposed (see for instance 
[Xu et al. 2008, Zhang & Small 2006, Lacasa et al. 2008, Luque et al. 2009] and a recent review on this topic [Donner 
et al 2011]). The general purpose is to investigate on the properties of the series through graph theoretical tools 
recently developed in the core of the celebrated complex network theory, opening the possibility of building bridges 
between time series analysis, nonlinear dynamics, and graph theory. Along this line, the family of visibility algorithms 
[Lacasa et al. 2008, Luque et al. 2009, Gutin et al. 2011] has been introduced recently. It has been shown that 
several degrees of correlations (including periodicity [Lacasa et al. 2008] , fractality [Lacasa et al. 2009] or chaoticity 
[Luque et al. 2009, Lacasa & Toral 2010]) can be captured by the algorithm and translated in the associated visibility 
graph. Accordingly, several works applying such algorithm in several contexts ranging from geophysics [Eisner et al. 
2009] or turbulence [Liu et al. 2010] to physiology [Shao 2010] or finance [Yang et al. 2010] have started to appear. 
Here we focus on a specific algorithm within this family called the horizontal visibility graph [Luque et al. 2009], 
which has been recently considered for the task of discriminating chaotic from correlated stochastic processes [Lacasa 
& Toral 2010]. While the first steps for a rigorous mathematical grounding have been reported recently [Gutin et al. 
2011], this method is currently largely unexplored, both from a purely theoretical or from an applied point of view. To 
partially solve such issues, in this work we address the task of filtering a noisy signal with a hidden periodic component 
within the horizontal visibility formalism, that is, we explore the possibility of using the method for noise filtering 
purposes. Periodicity detection algorithms (see for instance [Parthasarathy et al. 2006]) can be classified in essentially 
two categories, namely the time domain (autocorrelation based) and frequency domain (spectral) methods. Here we 
make use of the horizontal visibility algorithm to propose a third category, namely graph theoretical methods. 
The rest of the paper goes as follows: in section 2 we present the method, and provide some theorems regarding 
several topological properties of horizontal visibility graphs. In section 3 we introduce the concept of a graph- 
theoretical noise filter, and provide some examples of noisy periodic series, comparing in each case the performance of 
the proposed method with an autocorrelation function analysis. A pathological case that yields misleading results from 
the autocorrelation function is also considered. We finally provide a discussion on the the potentials and limitations 
of this approach. 

II. HORIZONTAL VISIBILITY ALGORITHM 

The horizontal visibility algorithm has been recently introduced [Luque et al. 2009] as a map between a time series 
and a graph and it is defined as follows. Let {xi}i=i,...,iv be a time series of N real data. The algorithm assigns each 
datum of the series to a node in the horizontal visibility graph (HVG). Accordingly, a series of N data map to an 
HVG with N nodes. Two nodes i and j in the graph are connected if one can draw a horizontal line in the time series 
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joining xi and Xj that does not intersect any intermediate data height. That is, i and j are two connected nodes if 
the following geometrical criterion is fulfilled within the time series: 



Xj. x , 



> x n , Vn | i < n < j 



(1) 



Basic properties of this graphs can be found in [Luque et al. 2009], and the first steps for a rigorous mathematical 
characterization can be found in[Gutin et al. 2011]. Among possible applications of the method for time series 
analysis purposes, discrimination between chaotic and stochastic signals has been recently addressed [Lacasa & Toral, 
2010]. 

In this section we provide some theorems regarding some specific topological properties of the horizontal visibility 
graphs, and in the following sections we will rely on these theorems to define a noise filtering technique. 



Theorem II. 1 (Mean degree of periodic series) 

The mean degree of an horizontal visibility graph associated to an infinite periodic series of period T (with no repeated 
values within a period) is 

1 



KT)=4(1-- 

Proof Without lack of generality, represent the series as {..., xq, Xi, Xx, x±, x%, ...}, where xq = xt corresponds 
to the largest value of the series. By construction, the associated HVG is composed as a concatenation of identical 
motifs, each of these motifs being itself an HVG of T + l nodes associated to the subseries Xo, x\, Xt, and the mean 
degree of the HVG k corresponds to the mean degree of the motif constructed with T nodes (the nodes associated 
to xq and xt only introduce half of their actual degree in the motif, what is equivalent to effectively reducing one 
node). Suppose that the motif is a graph with V edges, and let Xi be the smallest datum of the subseries (since no 
repetitions are allowed in the motif, Xi will always be well defined). By construction, the associated node i will have 
degree k = 2. Extract now from the motif this node and its two edges. The resulting motif will have V — 2 edges and 
T nodes. Iterate this operation T — 1 times (see figured] for a graphical illustration of this process). The resulting 
graph will have only two nodes, associated to xo and xt, linked by a single edge, and the total number of deleted 
edges will be 2(T — 1). Hence, 
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FIG. 1: Graphical illustration of the constructive proof of theorem lII.il considering a motif extracted from a periodic series of 
period T = 5. 



An interesting consequence of the previous result is that every time series extracted from a dynamical system has an 
associated HVG with a mean degree 2 < k < 4, where the lower bound is reached for constant series, whereas the 
upper bound is reached for aperiodic series (random, chaotic @). 
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Theorem II. 2 (Degree distribution associated to uncorrelated random series) 

Let {xi} be a bi-infinite sequence of independent and identically distributed random variables extracted from a continous 
probability density f(x). Then, the degree distribution of its associated horizontal visibility graph is 

P{k) = l{l) ' fc = 2 ' 3 ' 4 '- ( 2 ) 

A lengthy constructive proof can be found in [Luque et al. 2009] . Here we propose two alternative, shorter proofs for 
this theorem. 

Proof Let x be an arbitrary datum of the aforementioned series. The probability of its horizontal visibility being 
interrupted by a datum x r on its right and other datum xi on its left is, independently of f(x), 

/OO />oo />GO f- OO -i 

/ / f(x)f(x r )f(x l )dx l dx r dx= f(x)[l-F(x)] 2 dx= - (3) 
-oo J x Jx J —oo ^ 

The probability P(k) of the datum seeing exactly k data may be expressed as 

P(k) = Q(k)$ 2 = ±Q(fc) (4) 
where Q(k) is the probability of x seeing at least k data. Q(k) may be recurrently calculated as 

Q(fc) = Q(fc-l)(l-$ 2 ) = ^Q(fc-l) (5) 
from which the following expression can be deduced: 



2 \ fe-2 

Q(k)=[^) (6) 



Proof (2) The same result for the distribution P{k) can be deduced by means of expression $2 along with combinatoric 
arguments: if the arbitrary datum x is connected to exactly k data, there exist two data xi > x and x r > x that 
close the left and right visibility of x. As we have proven, this happens with a probability $2 = 1/3. The k — 2 
remaining data will therefore be smaller than x and will be distributed in a monotonically decreasing sequence on 
its left and a monotonically increasing sequence on its right respectively. The number of possible distributions with 
i data on its left and k — 2 — i on its right is ( k l i 2 ), where i = 0, 1, 2, . . . , k — 2. All these configurations can all be 
decomposed in k — 2 groups of three data with the central datum being closed by the two others, therefore, all of 
them are equiprobable with a probability 4>\~ 2 , then 



Observe that the mean degree k of the horizontal visibility graph associated to an uncorrelated random process is 
then: 




(8) 



in good agreement with the prediction of the previous theorem for aperiodic series. 
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A. Stochastic, chaotic and periodic processes 



Deviations from P(k) = (-|)(|) fe_2 are, according to previous theorem, univoquely associated to series which are 
not generated by a purely uncorrelated process. Several possibilities arise: first, the process can still be of a stochastic 
nature, while some correlations can be present. As a matter of fact, in [Lacasa & Toral, 2010] it has been shown 
that such kind of correlated stochastic series map into HVGs with a degree distribution which is also exponentially 
decaying, albeit with a larger slope than equation^ A second situation involves a deterministic process. Two opposite 
possibilities arise: the process can be either regular, what yields a periodic series of a given period T, or chaotic, 
what yields an aperiodic series. Periodic series have an associated HVG with a degree distribution formed by a finite 
number of peaks, these peaks being related to the series period, what is reminiscent of the discrete Fourier spectrum 
of a periodic series [Lacasa et al. 2008, Luque et al. 2009]. The reason is straightforward: a periodic series maps into 
an HVG which, by construction, is a repetition of a root motif. The second possibility has been addressed in [Luque et 
al. 2009, Lacasa & Toral, 2010], the conclusion being that chaotic processes have an HVG whose degree distribution 
has an exponential tail with smaller slope than equation [2] and evidences a net deviation from the exponential shape 
for small values of the degree, this deviation being associated to short-range memory effects. Last, an interesting 
situation takes place when a given regular process (periodic series) is polluted with a given amount of noise. 
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FIG. 2: Left: Periodic series of 2 20 data generated through the logistic map x n +i = fix n (l — x n ) for fi = 3.2 (where the map 
shows periodic behavior with period 2) polluted with extrinsic white gaussian noise extracted from a Gaussian distribution 
A(0,0.05). Right: Dots represent the degree distribution of the associated HVG, whereas the straight line is equation [9] (the 
plot is in semi-log). Note that P{2) — 1/2, also as theory predicts, and that P(3) is not exactly zero due to boundary effects 
in the time series. The algorithm efficiently detects both signals and therefore easily distinguishes extrinsic noise. 



Indeed, if we superpose a small amount of noise to a periodic series (a so-called extrinsic noise), while the degree of 
the nodes with associated small values will remain rather similar, the nodes associated to higher values will eventually 
increase their visibility and hence reach larger degrees. Accordingly, the delta-like structure of the degree distribution 
(associated with the periodic component of the series) will be perturbed, and an exponential tail will arise due to the 
presence of such noise [Luque et al. 2009, Lacasa & Toral, 2010]. Can the algorithm characterize such kind of series? 
The answer is positive, since the degree distribution can be analytically calculated as it follows: 

Consider for simplicity a period-2 time series polluted with white noise (see the left part of figure [2] for a graphical 
illustration). The HVG is formed by two kind of nodes: those associated to high data with values (xi,xz,X5, ...) in 
the figure and those associated to data with small values (x2, X4,xq, ...). These latter nodes will have, by construction, 
degree k = 2. On the other hand, the subgraph formed by the odd nodes associated to data (x±, X3, X5, ...) will 
essentially reduce to the one associated to an uncorrelated series, i.e. its degree distribution will follow equation [2] 
Now, considering the whole graph, the resulting degree distribution will be such that 

P(2) = 1/2, 
P(3) = 0, 

1 /2\ fe ~ 2 
^• + 2) = -(-) ,*>2, 

1 /2\ k ~ 3 

orP(fc) = 2 (r) ,fc>4, (9) 
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that is to say, introducing a small amount of extrinsic uncorrelated noise in a periodic signal introduces an exponential 
tail in the HVG's degree distribution with the same slope as the one associated to a purely uncorrelated process. The 
mean degree k reads 

oo 

k = J2kP{k) = 4, 

k=2 

which, according to eauation lII.il suggests aperiodicity, as expected. In the left part of figure[2]we plot in semi-log the 
degree distribution of a periodic-2 series of 2 20 data polluted with an extrinsic white Gaussian noise extracted from 
a Gaussian distribution N(0, 0.05). Numerical results confirm the validity of equation[SJ Note that this methodology 
can be extended to every integrable deterministic system, and therefore we conclude that extrinsic noise in a mixed 
time series is, in principle, well captured by the algorithm. Based on the previous theorems, in the next section we 
introduce a method to calculate the hidden periodicity in a noisy periodic signal. 

III. A GRAPH-THEORETICAL NOISE FILTER 

A. Definition and examples 

Let S — {xi}i = i t , „ be a periodic series of period T (where n >> T) polluted by a certain amount of extrinsic 
noise (without loss of generality, suppose a white noise extracted from a uniform distribution U[— 0.5, 0.5]), and define 
the filter / as a real valued scalar such that / G [minxj, maxxi]. The so called filtered Horizontal Visibility Graph 
(f-HVG) associated to S is constructed as it follows: 

(i) each datum xi in the time series is mapped to a node i in the f-HVG, (ii) two nodes i and j are connected in the 
fHVG if the associated data fulfill 

Xi,Xj > x n + f, Vn | i < n < j . (10) 

The procedure of filtering the noise from a noisy periodic signal goes as follows: one generates the f-HVG associated 
to S for increasing values of /, and in each case proceeds to calculate the mean degree k. For the proper interval 
fmin < f < fmax, the f-HVG of the noisy periodic series S will be equivalent to the noise free HVG of the pure 
(periodic) signal, which has a well defined mean degree as a function of the series period. In this interval, the mean 
degree will therefore remain constant, and from equation III. II the period can be inferred. 

As an example, we have artificially generated a noisy periodic series of hidden period T = 2, that we plot in the left 
panel of figure |31 The results of the graph filtering technique are shown in the middle panel of this figure, where we 
plot the values of k as a function of /. Notice that the graph filtering yields a net decreases of the mean degree, which 
has an initial value of 4 (as expected for the HVG (/ = 0) of an aperiodic series such as a noisy periodic signal) and 
an asymptotic value of 2 (lower bound of the mean degree). The plateau is clearly found at k — 3, which according 
to equation III. 1 1 yields a period 

as expected. For comparison, the autocorrelation function ACF(t) of the series is also calculated, according to the 
following definition 

ACF(t) =< x(t)-x(t-T) > t , 

such that ACF is not bounded in [—1, 1] since it is not normalized. This expression has a periodic shape of period 
T when the series is itself periodic with period T, whereas aperiodic structures yield an autocorrelation function 
that lacks any structure. In the right panel of figure [3] we plot the values of the autocorrelation, showing a period-2 
structure as expected. 

At this point we conclude that the noise filtering is yet another feature of standard time series analysis that can be 
recovered in the visibility theory. An example with a noisy periodic series of period T = 5 is plotted in figure |4j 

B. Noisy periodic versus chaotic 

The autocorrelation function is an extremely useful tool to unveil periodic structures in noisy data, in this sense 
the aforementioned filter is not meant to be used instead an autocorrelation analysis, but rather as a complementary 
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FIG. 3: Left: Periodic series of period T = 2 polluted with extrinsic noise extracted from a uniform distribution U[— 0.5, 0.5] 
of amplitude 0.1. Middle: Values of the HVG's mean degree k as a function of the amplitude of the graph theoretical filter. 
The first plateau is found for k = 3, which renders a hidden period T — (2 — fc/2) -1 = 2. The second plateau corresponding to 
k — 2 is found when the filter is large enough to screen each datum with its first neighbors, such that the mean degree reaches 
its lowest bound. Right: Autocorrelation function of the noisy periodic series, which is itsef an almost periodic series with 
period T = 2, as it should. 
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FIG. 4: Left: Periodic series of period T = 5 polluted with extrinsic noise extracted from a uniform distribution U[— 0.5, 0.5] 
of amplitude 0.05. Middle: Values of the HVG's mean degree k as a function of the amplitude of the graph theoretical filter. 
The first plateau is found for k = 3.6, which renders a hidden period T — (2 — k/2)^ 1 = 5. Right: Autocorrelation function of 
the noisy periodic series, which is itsef an almost periodic series with period T — 5, as it should. 

study. As a matter of fact, in specific situations it may happen that an autocorrelation analysis may provide misleading 
results. This is for instance the case of chaotic maps with disconnected attractors. Consider the well known Logistic 
map 

x t +i = l±x t (\ - x t ), 

with fj, e [0,4] and x G [0,1]. This map generates periodic series for \i < ^ — 3.569..., while for fx > fioo the 
map generates chaotic (deterministic and aperiodic) series (besides regions where the orbit turns regular again, called 
islands of stability [Peitgen et al. 1992]). In the chaotic region, the chaotic attractor is the whole interval [0, 1] only 
for [i — 4. Concretely, for [i e [3.6, 3.67] the attractor is partitioned in two disconnected chaotic bands, and the 
chaotic orbit makes an alternating journey between both bands. In figure [S] we have plotted a time series of 2 18 data 
generated through the Logistic map at /x = 3.65. Note that the map is ergodic, but the attractor is not the whole 
interval, as there is a gap between both chaotic bands. In this situation, the chaotic series is by definition not periodic, 
however, an autocorrelation function analysis indeed suggests the presence of periodicity (see the left panel of figure 
[5]), what is reminiscent of the disconnected two-band structure of the attractor. Interestingly enough, applying the 
aforementioned noise filter technique, at odds with the autocorrelation function, the results suggests that the method 
does not find any periodic structure, as it should (middle panel of figure |6|). Furthermore, information of both the 
phase space structure and the chaotic nature of the map becomes accesible from an analysis of the HVGs degree 
distribution (plotted in semi-log scale in the right panel of figure|51): first, we find P(2) = 1/2, that indicates that half 
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FIG. 5: Left: Series extracted from the Logistic map at n — 3.65, where the map is chaotic and the attractor is partitioned in 
two disconnected chaotic bands. Right: Same plot as the left panel, for the first 75 values of the series. 
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FIG. 6: Left: Autocorrelation function of the chaotic series plotted in figure [5] which suggests that the series has a periodic 
component. This misleading result is consequence of the orbit within the attractor, more specifically to the alternated visit to 
the disconnected bands in the chaotic attractor. Middle: Values of the HVG's mean degree k associated to the same chaotic 
series, as a function of the amplitude of the graph theoretical filter. No plateau is found, what suggests that the series lacks 
any periodic structure, as it should. Right: Degree distribution P(k) of the HVG associated to the chaotic series, in semi-log 
scale. P(2) — 1/2, what is reminiscent of the attractor structure and the order of visits to chaotic bands (half of the nodes 
correspond to data located in the bottom chaotic band, that by construction has degree k = 2). The tail of the distribution is 
exponential with a slope that deviates from the distribution associated to a purely uncorrelated process, what is an indication 
of a chaotic process, according to a previous study on HVGs [f|. 



of the data are located in the bottom chaotic band, in agreement with the alternating nature of the chaotic orbit. 
This is reminiscent of the misleading result obtained from the autocorrelation function. Second, the tail of the degree 
distribution is exponential, with an asymptotic slope smaller than the one obtained rigorously (theorem III. 21 and [8() 
for a purely uncorrelated process. This is, according to a recent study on HVGs [Lacasa & Toral, 2010], characteristic 
of an underlying chaotic process. 



IV. DISCUSSION 



In this work we have outlined some properties of the HVGs associated to time series extracted from dynamical 
systems, and have accordingly proposed a method to detect periodicity in signals polluted with noise. The results 
suggest that the HVG correctly inherits the hidden periodicity of noisy signals, and can be retrieved by making use 
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of the aforementioned filter in situations where the noise level is not very large (indeed, the maximum power of the 
noise is of order 0(Ax 2 ), where Ax = min{|a;j — Xj\}ij). 

Also, we have found that specific pathological cases where a classical time series analysis yields misleading results, 
such as in chaotic (aperiodic) series generated from chaotic maps with a disconnected attractor, can be efficiently 
analyzed within this network-based tool. This approach is radically different from traditional methods for time 
series analysis since it is based on graph theoretical properties, and therefore can be used as a complementary tool 
in practical situations. We have deliberately not tackled the task of systematically comparing the goodness of such 
graph theoretical method with other standard tools of time series analysis, and have restricted this comparison to 
checking that an autocorrelation function analysis provides equivalent results for the cases addressed here. In this 
sense, we must emphasize that the goal of this work is to push forward the state of the art in the visibility theory 
providing mathematically sound properties, rather than putting the practical usefulness of both methods in direct 
competence. 
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